Optimization of variance-stabilizing transformations
نویسنده
چکیده
Variance-stabilizing transformations are commonly exploited in order to make non-homoskedastic data easily tractable by standard methods. However, for the most common families of distributions (e.g., binomial, Poisson, etc.) exact stabilization is not possible and even achieving some approximate stabilization turns out to be rather challenging. We approach the variance stabilization problem as an explicit optimization problem and propose recursive procedures to minimize a nonlinear stabilization functional that measures the discrepancy between the standard deviation of the transformed variables and a xed desired constant. Further, we relax the typical requirement of monotonicity of the transformation and introduce optimized nonmonotone stabilizers which are nevertheless invertible in terms of expectations. We demonstrate a number of optimized variancestabilizing transformations for the most common distribution families. These stabilizers are shown to outperform the existing ones. In particular, optimized variance-stabilizing transformations for low-count Poisson, binomial, and negative-binomial data are presented.
منابع مشابه
Approximate Variance-stabilizing Transformations for Gene-expression Microarray Data
MOTIVATION A variance stabilizing transformation for microarray data was recently introduced independently by several research groups. This transformation has sometimes been called the generalized logarithm or glog transformation. In this paper, we derive several alternative approximate variance stabilizing transformations that may be easier to use in some applications. RESULTS We demonstrate...
متن کاملClassroom Simulation: Are Variance-stabilizing Transformations Really Useful
When population variances of observations in an ANOVA are a known function of their population means, many textbooks recommend using variancestabilizing transformations. Examples are: square root transformation for Poisson data, arcsine of square root for binomial proportions, and log for exponential data. We investigate the usefulness of transformations in onefactor, 3-level ANOVAs with nonnor...
متن کاملOn Bootstrap Procedures for Second-order Accurate Confidence Limits in Parametric Models
This paper concerns the use of simulation procedures to construct secondorder accurate con dence limits having coverage error of order O(n ). An explicit formula for the analytical adjustment required in Efron's (1987) BCa percentile method is derived, automatic percentile methods that do not require analytical adjustments are proposed, and variance-stabilizing transformations designed to impro...
متن کاملVariance-stabilizing and Confidence-stabilizing Transformations for the Normal Correlation Coefficient with Known Variances
Fosdick and Raftery (2012) revisited the classical problem of inference for a bivariate normal correlation coefficient ρ when the variances are known. They considered several frequentist and Bayesian estimators, the former including the maximum likelihood estimator (MLE), but did not obtain the standard errors of these estimators or confidence intervals for ρ. Here we present a new variance-sta...
متن کاملTransforming RNA-Seq Data to Improve the Performance of Prognostic Gene Signatures
Gene expression measurements have successfully been used for building prognostic signatures, i.e for identifying a short list of important genes that can predict patient outcome. Mostly microarray measurements have been considered, and there is little advice available for building multivariable risk prediction models from RNA-Seq data. We specifically consider penalized regression techniques, s...
متن کامل